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USING CONTENT IDENTIFIERS TO DOWNLOAD CD-COVER 
PICTURES TO REPRESENT AUDIO CONTENT ITEMS 

The present invention relates generally to interactive systems for storing 
and/or playing audio content, and more particularly, to interactive systems that use content 
5 identifiers to download pictorial items, such as CD cover pictures to represent audio 
content items. 

Interactive systems are known in the art for storing and playing audio 
content items, such as songs, speeches, spoken novels and the like. The interactive systems 
can include a personal computer, laptop computer, or audio content players such as a CD 

1 0 player, MP3 player, a personal digital assistant, or a radio receiver. The audio content can 
be represented on a display screen with an icon. Generally, the icon must be custom made 
or chosen from a plurality of default icons in the interactive system. Typically, in current 
interactive systems, there is no way of graphically indicating a stored, downloading, or 
playing audio content to a user where the graphic indication is specifically related to the 

1 5 audio content. 

Therefore it is an object of the present invention to provide methods and 
interactive systems that overcome these and other disadvantages associated with the prior 
art. 

Accordingly, a method for displaying a pictorial item corresponding to an 
2 0 audio content item is provided. The method comprising: identifying one or more 
characteristics of the audio content item; searching a first remote network for the 
corresponding pictorial item based on the identified one or more characteristics; retrieving 
the corresponding pictorial item from the first remote network; and displaying the pictorial 
item corresponding to the audio content item. 

2 5 The identifying can comprise: searching the audio content item for a portion 

containing the one or more characteristics; and retrieving the one or more characteristics 
from the portion. The portion can comprise an ID3 tag. 

The identifying can comprise: searching a second remote network for the 
one or more characteristics corresponding to the audio content item; and retrieving the one 

3 0 or more characteristics from the second remote network. The second remote network can 

contain a database having a plurality of audio content items and one or more corresponding 
characteristics for each of the plurality of audio content items. The identifying can also 
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comprise analyzing the audio content item to obtain an audio fingerprint containing the one 
or more characteristics. 

The displaying can comprise downsizing the pictorial item into a thumbnail 
icon at one of prior to the retrieving and subsequent to the retrieving. The downsizing can 
5 further comprise reducing a resolution of the pictorial item at one of prior to the retrieving 
and subsequent to the retrieving. 

The method can further comprise storing the pictorial item subsequent to the 

retrieving. 

The identifying can be initiated automatically upon occurrence of an event. 
1 0 The event can be one of storing the audio content item, playing the audio content item, 
receiving a transmission of the audio content item, and downloading the audio content 
item. The identifying can also be initiated by a manual indication from a user. 

The displaying can be initiated automatically upon occurrence of an event. 
The event can be one of storing the audio content item, playing the audio content item, 
15 receiving a transmission of the audio content item, and downloading the audio content 
item. The displaying can also be initiated by a manual indication from a user. 

The pictorial item can be a CD cover corresponding to at least the audio 

content item. 

The audio content item can be a song track from a plurality of song tracks 
2 0 corresponding to the CD cover and the displaying can further comprise altering the CD 
cover to indicate the song track. The altering can comprise displaying a numerical value 
with the CD cover, the numerical value indicating a song track number of the song track. 

Also provided is an apparatus for displaying a pictorial item corresponding 
to an audio content item. The apparatus comprising: a modem operatively connected with 

2 5 a remote network; a processor for identifying one or more characteristics of the audio 

content item; searching the remote network for the corresponding pictorial item based on 
the identified one or more characteristics; and retrieving the corresponding pictorial item 
from the first remote network; and a display for displaying the pictorial item corresponding 
to the audio content item. 

3 0 Also provided are a computer program product for carrying out the methods 

of the present invention and a program storage device for the storage of the computer 
program product therein. 
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These and other features, aspects, and advantages of the apparatus and 
methods of the present invention will become better understood with regard to the 
following description, appended claims, and accompanying drawings where: 

Figure 1 illustrates a schematic illustration of an embodiment of an 
apparatus according to the present invention. 

Figure 2 illustrates a display from the apparatus of Figure 1. 

Although this invention is applicable to numerous and various types of 
apparatus and audio content items, it has been found particularly useful in the environment 
of songs. Therefore, without limiting the applicability of the invention to songs, the 
invention will be described in such environment. Those skilled in the art will also 
appreciate that the audio content items can be speeches, spoken novels and the like. 

Referring now to Figure 1 , an apparatus for displaying a pictorial item 
corresponding to an audio content item is shown, the apparatus being generally referred to 
by reference numeral 100. Those skilled in the art will appreciate that the apparatus can be 
any device capable of reproducing audio content items and displaying a representative 
pictorial item. Examples of such apparatus 100 include but is not limited to a personal 
computer, a laptop computer, and audio content players such as a CD player, MP3 player, a 
personal digital assistant, and a radio receiver. The apparatus 100 generally includes one 
or more speakers 102 for reproducing the audio content item and a display 104. As will be 
discussed below, the display 104 is capable of displaying a pictorial item that is 
representative of an audio content item, such as a corresponding CD cover. The display 
104 can be dedicated to such a function or perform other functions of the apparatus 100. 

The apparatus further has a processor 106 for carrying out the methods of 
the present invention. The processor 106 may be dedicated to such functions or perform 
other functions of the apparatus 100. The processor 106, either directly or indirectly 
through appropriate drivers, controls the speaker(s) 102 and display 104. The apparatus 
100 further has a storage device 108, such as a memory operatively connected to the 
processor for storing the audio content items. The storage device 108 may also store 
program instructions for carrying out the methods of the present invention as well as for 
carrying out other functions of the apparatus 100. Although, the storage device 108 is 
shown in Figure 1 as a single unit, those skilled in the art will appreciate that it can be 
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provided in multiple units and in different types of storage devices. For example, the 
storage device 108 may be implemented as a hard drive for storing the audio content items 
and an integrated chip for storing program instructions. 

5 The apparatus further can include an input device 1 10 for inputting data or 

instructions to the processor 106. The input device 1 10 can be any input device known in 
the art, such as a keyboard/keypad, a mouse, a stylus pen, a rollerball, a speech recognition 
system, a touch screen, one or more switches/button pads, a connector for uploading audio 
content items from other devices, such as an MP3 player, and a CD/DVD drive. The 

10 apparatus 100 also includes a communication device 112, such as a modem under the 

control of the processor 106 and operatively connected to a remote network 1 14, such as 
the Internet. The apparatus 100 can also include a radio receiver, shown schematically by 
antenna 116. The radio receiver is under the control of the processor and configured to 
receive, demultiplex, and/or format any type of radio transmission known, such as FM, 

15 AM, cable, and satellite. 

Methods for displaying a pictorial item corresponding to an audio content 
item will now be described with regard to Figures 1 and 2. The methods assume, unless 
otherwise noted, that the audio content items are available to the apparatus 100, either by 
the input device 1 10, the remote network 1 14, or the radio receiver 1 16. The method 

2 0 identifies one or more characteristics of the audio content item. The one or more 

characteristics can be an author of the song, a performer of the song, a title of the song, a 
corresponding album (CD) of which the song is a part, the year the song was released, a 
genre of the song and the like. 

After the one or more characteristics are identified, the apparatus searches 

2 5 the remote network 1 14 for a corresponding pictorial item based on the identified one or 

more characteristics. Specifically, the processor 106 instructs the modem to contact the 
remote network, retrieve the corresponding pictorial item from the remote network 1 14, 
and display the pictorial item corresponding to the audio content item on the display 104. 
The remote network 1 14 can be the Internet, and more specifically, a web site having an 

3 0 address on the Internet where a database of pictorial items, such as CD covers are stored 

for downloading. Such an Internet web site is www.cover-search.com. For example, the 
one or more characteristics identified can be the song title and artist of the audio content 
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item. The processor 106 can contact www.cover-search.com via the modem 1 12 and 
supply the song title and artist for a search of the database. The processor would then 
retrieve a CD cover corresponding to the search results. If the search results in more than 
one CD cover, more characteristics can be supplied to narrow the search or all of the CD 
5 covers resulting from the search can be retrieved and the user of the apparatus 100 can 

choose among the CD covers retrieved for display. The displayed CD cover can then serve 
as a link to the reproduction of the corresponding audio content item. 

The identification of the characteristics of the audio content item can 
comprise searching the audio content item itself for a portion containing the characteristics 

1 0 and retrieving the characteristics from the portion. The portion of the audio content item 
can be an ID3 tag. An ID3 tag is 128 bytes of data added to the end of an audio content 
file. The ID3 tag contains 3 bytes of the word "tag," 30 bytes for the song title, 30 bytes 
for the artist, 30 bytes for the name of the corresponding album, 4 bytes for the year of 
release, 30 bytes for comments, and 1 byte for genre. Bytes not used are filled with a 

15 binary 0 value. Also, the genre field contains a value that corresponds to a predefined list 
of 80 genres (values 0-79). Where the audio content item contains such a tag, the processor 
106 searches the audio content item for the tag. Where the tag is an ID3 tag, the processor 
106 would look for the word "tag" 128 bytes from the end of the audio content file. The 
processor 106 would then retrieve the characteristics in each of the fields and use the same 

2 0 to search the remote network 1 14 for a corresponding pictorial item. As discussed above, 
the search can start using any one or combination of the characteristics, such as song title 
and artist. Any of the other characteristics identified can be added to the search should the 
search result in no or more than one result. 

The identification of the characteristics of the audio content item can also 

2 5 comprise searching a remote network 1 14 (different or the same as the remote network 

used to search for the pictorial item) for the one or more characteristics corresponding to 
the audio content item and retrieving the one or more characteristics from the remote 
network 114. In this case, the remote network can contain a database of a plurality of 
audio content items and one or more corresponding characteristics for each of the plurality 

3 0 of audio content items in the database. The remote network can be an Internet web site, 

such as www.musicbrainz.org. Such web sites contain a database of music metadata that 
can be used to identify audio content items. In this situation, the processor 106 contacts the 
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remote network via the modem 1 12 to determine one or more characteristics of the audio 
content item and retrieves any results from the web site. The processor would then use the 
same to search the remote network 1 14 for a corresponding pictorial item similarly to that 
described above. 

5 The identification of the characteristics of the audio content item can also 

comprise analyzing the audio content item itself to obtain an audio fingerprint containing 
the one or more characteristics. Such audio fingerprinting is known in the art such as that 
disclosed in J.A. Haitsma, Ton Kalker, "A highly robust audio fingerprinting system", 
2002, IRCAM; and T. Kalker, "Applications and Challenges for Audio Fingerprinting", 

10 presentation at the 1 1 1th AES Convention, NY, in the "Watermarking versus 

Fingerprinting" workshop, December 3, 200 1, the disclosures of which are incorporated 
herein by their reference. 

As discussed above, the pictorial item, such as a CD cover is displayed on 
the display 104 to represent the audio content item. Such CD covers are often available in 

15 full size and at high resolution for printing. In such instances, it may be desirable to 

downsize the pictorial item into a thumbnail icon either prior to being retrieved from the 
remote network 1 14 or subsequent to the retrieving by the processor 106. Furthermore, the 
downsizing can also include reducing the resolution of the pictorial item before displaying 
it on the display 104. Similarly to the downsizing, the reduction in resolution can be done 

2 0 either prior to being retrieved from the remote network 1 14 or subsequent to the retrieving 
by the processor 106. The pictorial item can be retrieved and displayed on the display 106 
without being saved to the memory 108, such as when a song is being played on a radio 
apparatus 100. Alternatively, the pictorial item can be retrieved and stored in the memory 
108 subsequent to the retrieving. In the latter, the pictorial item is always available for 

2 5 display. 

The identification process can be initiated automatically upon occurrence of 
an event, such as storing the audio content item in the memory 108, playing the audio 
content item on the speaker 102, receiving a transmission of the audio content item from 
the antenna 1 16 (or other transmission means), and downloading the audio content item 

3 0 from the remote network 114. Alternatively, the identification process can be initiated by a 

manual indication from a user, such as through the input device 1 10. The displaying of the 
pictorial item can also be initiated automatically upon occurrence of an event, such as 
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storing the audio content item in the memory 108, playing the audio content item on the 
speaker 102, receiving a transmission of the audio content item from the antenna 1 16 (or 
other transmission means), and downloading the audio content item from the remote 
network 1 14. The displaying of the pictorial item can also be initiated by a manual 
5 indication from a user, such as through the input device 1 10. For example, if a command is 
received by the processor 106 to download an audio content item, such as a song from the 
Internet 1 14, the commencement of the download can initiate both the identification of one 
or more characteristics of the audio content item, and the displaying of a representative 
pictorial item found as a result of the identification of the characteristics. Similarly, if a 

1 0 command is received by a processor to tune to a certain frequency and an audio content 

item, such as a song, is being transmitted on that frequency and reproduced on the speaker 
1 02, the reproduction (playing) of the audio content item can initiate the indication and 
display processes as described above. 

Where the audio content item is a song track from a plurality of song tracks 

15 contained on an album, an indication can be displayed with the pictorial item to identify the 
song track and to distinguish it from the other song tracks on the album. Each song track 
on an album may not have a corresponding pictorial item. All song tracks identified with 
an album may only have a single representative pictorial item, such as a CD cover. 
Therefore, the methods of the present invention can alter the CD cover to indicate the song 

2 0 track. Referring now to Figure 2, there is shown the display 104 of Figure 1 having 

pictorial items 200 corresponding to audio content items. The display 104 further has a 
pictorial item 202 as a CD cover in which the same is altered to indicate the song track 
from the corresponding album. As shown in Figure 2 by pictorial item 202, the altering 
can comprise displaying a numerical value 204 with the CD cover where the numerical 

2 5 value indicates a song track number of the song track. For example, where the audio 

content item is the third song track of an album, the pictorial item 202 can be altered to 
overlay a number "3" on at least a portion of the pictorial item 202. The numerical value 
204 can be positioned anywhere within the pictorial item 202 and can also be positioned 
outside the pictorial item 202. Those skilled in the art will appreciate that other types of 

3 0 alteration can be made to the pictorial item 202 to indicate the particular song track from 

among a plurality of song tracks on an album, such as a title or abbreviated version of the 
title or an additional pictorial item. 
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The methods of the present invention are particularly suited to be carried out 
by a computer software program, such computer software program preferably containing 
modules corresponding to the individual steps of the methods. Such software can of course 
be embodied in a computer-readable medium, such as an integrated chip or a peripheral 
5 device. 

While there has been shown and described what is considered to be 
preferred embodiments of the invention, it will, of course, be understood that various 
modifications and changes in form or detail could readily be made without departing from 
the spirit of the invention. It is therefore intended that the invention be not limited to the 
10 exact forms described and illustrated, but should be constructed to cover all modifications 
that may fall within the scope of the appended claims. 
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